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Abstract. Accounts of semantic phenomena often involve extending types of 
meanings and revising composition rules at the same time. The concept of monads 
allows many such accounts — for intensionality, variable binding, quantification and 
focus — to be stated uniformly and compositionally. 



1 Introduction 

The Montague grammar tradition formulates formal semantics for natural 
languages in terms of the A-calculus. Each utterance is considered a tree in 
which each leaf node is a lexical item whose meaning is a (usually typed) 
value in the A-calculus. The leaf node meanings then determine meanings 
for subtrees, through recursive application of one or more composition rules. 
A composition rule specifies the meaning of a tree in terms of the meanings 
of its sub-trees. One simple composition rule is function application: 

[xy} = M(M)--P where [z] : a - f3 and {yj : a. (24.1) 

Here a and f3 are type variables, and we denote function types by — 

To handle phenomena such as intensionality, variable binding, quantifi- 
cation and focus, we often introduce new types in which to embed existing 
aspects of meaning and accommodate additional ones. Having introduced 
new types, we then need to revise our composition rules to reimplement 
existing functionality. In this way, we often augment semantic theories by 
simultaneously extending the types of meanings and stipulating new com- 
position rules. When we augment a grammar, its original lexical meanings 
and composition rules become invalid and require global renovation (typi- 
cally described as "generalizing to the worst case" ( Partee 1996]) ). Each time 
we consider a new aspect of meaning, all lexical meanings and composition 
rules have to be revised. 

Over the past decade, the category-theoretic concept of monads has 
gained popularity in computer science as a tool to structure denotational se- 



mantics ( Moggi 199C ; Moggi 1991 ) and functional programs ( Wadler 1992a ; 
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Wadlcr 1992b| ). When used to structure computer programs, monads allow 
the substance of a computation to be defined separately from the plumb- 
ing that supports its execution, increasing modularity. Many accounts of 
phenomena in natural language semantics can also be phrased in terms of 
monads, thus clarifying the account and simplifying the presentation. 

In this paper, I will present the concept of monads and show how they 
can be applied to natural language semantics. To illustrate the approach, I 
will use four monads to state analyses of well-known phenomena uniformly 
and compositionally. By "uniformly" I mean that, even though the analyses 
make use of a variety of monads, they all invoke monad primitives in the 
same way. By "compositionally" I mean that the analyses define composition 
rules in the spirit of Montague grammar. After presenting the monadic 
analyses, I will discuss combining monads to account for interaction between 
semantic phenomena. 

2 Monadic analyses 

Intuitively, a monad is a transformation on types equipped with a composi- 
tion method for transformed values. Formally, a monad is a triple (M, rj,*), 
where M is a type constructor (a map from each type a to a corresponding 
type Ma), and r\ and * are functions (pronounced "unit" and "bind") 

rj : a -» Ma, ★ : Ma -> (a -» M/3) -» M/3. (24.2) 

These two functions are polymorphic in the sense that they must be defined 
for all types a and (3. Roughly speaking, r\ specifies how ordinary values 
can be injected into the monad, and * specifies how computations within 
the monad compose with each other .[] Some concrete examples follow. 

2.1 The powerset monad; interrogatives 

As a first example, consider sets. Corresponding to each type a we have a 
type a — > t, the type of subsets of a. We define^ 

Ma = a -> i Va, (24.4a) 

77(a) = {a} : Ma Va : a, (24.4b) 

ra-kk = |J aem k(a) : M/3 Vm : a -> t, k : a -> (3 -» t. (24.4c) 

1 By definition, 77 and * must satisfy ieft identity, right identity, and associativity: 

77(0) * k = k(a) Va : a, k : a -> M/3, (24.3a) 

m * 77 = m Vm : Ma, (24.3b) 

(m*fc) *l = m* (\v.k(v)*l) Vm : Ma, k : a -> M/3, / : /3 -» M7. (24.3c) 

2 In this section and the next, we treat types as sets in order to define the powerset 
and pointed powerset monads. These two monads do not exist in every modei of the 
A-calculus. 
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The powerset monad is a crude model of non-determinism. For example, 
the set of individuals mi defined by 

mi = {John, Mary} : Me 

can be thought of as a non-deterministic individual — it is ambiguous between 
John and Mary. Similarly, the function ki defined by 

ki : e -> M(e -> t) 
fci(a) = {Ax. like(a, x), Xx. hate(a, x)} : M(e — > t) 

maps each individual to a non-deterministic property. To apply the function 
k± to the individual mi, we compute 

mi * ki = Uae{John,Mar y }{ A2; - like (°> x )> Xx - hate(a, x)} 
= { Ax. like( John, x), Xx. hate( John, x) , 

Ax.like(Mary,x), Ax. hate(Mary, x)} : M(e -> i). 

We see that the non-determinism in both mi and fci is carried through to 
produce a 4- way- ambiguous result. 

Most words in natural language are not ambiguous in the way mi and 
ki are. To upgrade an ordinary (deterministic) value of any type a to the 
corresponding non-deterministic type Ma, we can apply rj to the ordinary 
value, say John: 

??(John) = {John} : Me, ^ 
{John} -kki = {Xx. like(John, x), Ax. hate(John, x)} : M(e — > t). 

Similarly, to convert an ordinary function to a non-deterministic function, 
we can apply n to the output of the ordinary function, say &)2 below: 

&2 = Act. Ax. like(a, x) : e — > e — > t, 
V ok 2 = Xa.{Xx. like(a, x)} : e -> M(e -» t), (24.6) 
mi * (77 o fc 2 ) = {Ax. like(John, x), Ax. like(Mary, x)} : M(e — > i). 

In both ( |24.5| ) and ( 24.6| ), an ordinary value is made to work with a non- 
deterministic value by upgrading it to the non-deterministic type. 

Consider now the function application rule ( 24.1 ). We can regard it as 
a two-argument function, denoted A and defined by 

A : (a -» (3) - a ^ 4 
= /(x) : V/:a^A x : a. 

We can lift ordinary function application A to non-deterministic function 
application ^4m, defined by 



A M : M(a -» /3) -> Ma - 
A M (/)W = / * [Aa. x * [A6. r/(a(b))]] : M/3 (24. 
V/ : M(a ->P),x: Ma. 
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Substituting ( g4jg ) into ( |24~8| ), we get 

A M (f)(x) = { 0(6) I o € /, 6 G x } C /3 V/Ca^ftiCa. (24.9) 

Just as the definition of ^4 in ( |24.7| ) gives rise to the original composition 
rule ( pil| ), that is, 

lxy}=A(l X })(M), (24.10) 
the definition of Am in ( 24.8; ) gives rise to the revised composition rule 

M = A M (H)(M)- (24.11) 

For the powerset monad, this revised rule is the set-tolerant composition 
rule in the alternative semantics analysis of interrogatives first proposed by 
Hamblin (1973). In Hamblin's analysis, the meaning of each interrogative 
constituent is a set of alternatives available as answers to the question; this 
corresponds to the definition of M in (24.4). By contrast, the meaning of 
each non-interrogative constituent is a singleton set; this corresponds to the 
definition of r/ in ( 24.4] ) . 

To support question-taking verbs (such as know and ask), we (and [Hai~ 
blin| ) need a secondary composition rule in which A is lifted with respect to 



the function / but not the argument x: 
lxyj=A' M (lx})(ly}) where 



A M : M(Mq -4/?)^Ma^ M/3, (24.12) 
A M {f)(x) = f * [Xa. 7](a(x))] : M/3 V/ : M(Ma -> P), x : Ma. 

Substituting ( pUf ) into ( |24~1~2D , we get 

= { a (%) \aef}Q(3 V/C(^^ftiC ai (24.13) 



Note that, for any given pair of types of [xj and [y] , at most one of ^4m (j24.8[) 
and ^4^| ( 24.12j ) can apply. Thus the primary composition rule ( |24.11| ) and 
the secondary composition rule ( 24.12 ) never conflict. 



2.2 The pointed powerset monad; focus 

A variation on the powerset monad (|24.4[) is the pointed powerset monad; it 
is implicitly involved in Rooth's ( 1996| ) account of focus in natural language. 
A pointed set is a nonempty set with a distinguished member. In other 
words, a pointed set x is a pair x = (xo,x\), such that xo is a member of 
the set x\. Define the pointed powerset monad by 



Ma = { (xq, xi) j xo £ x\ C a } 
77(a) = (a, {a}) : Ma 
m*k= ([A;(mo)]o,Ua G »n 1 [ A: ( a )]i) : 



Va, 
Va : a, 

Vm : Ma, k : a 



(24.14a) 
(24.14b) 
(24.14c) 
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This definition captures the intuition that we want to keep track of both a set 
of non-deterministic alternatives and a particular alternative in the set. As 
with the powerset monad, the definition of m-kk carries the non-determinism 
in both m and k through to the result. 

Substituting our new monad definition ( p4.14 ) into the previously lifted 
application formula ( |24.8[) gives 

^m(/o,/i)(^o,^i) = (/o(a?o),{a(6) I a € ft, b e xi }) : M/3 
V(/ 0) /i) :M(a->0), (x 0) xi) : Ma. 



This formula, in conjunction with the primary composition rule (24.11), is 
equivalent to [Rooth 's recursive definition of focus semantic values. 



Crucially, even though the pointed powerset monad extends our mean- 
ing types to accommodate focus information, neither our definition of ^4m 



in ( 24.8 ) nor our composition rule ( 24.11 ) needs to change from before 



Moreover, the majority of our lexical meanings have nothing to do with 
focus and thus need not change either. For example, in the hypothetical 
lexicon entry [Jo/m] = r/(John), the upgrade from meaning type e to mean- 
ing type Me occurs automatically due to the redefinition of r\. 

2.3 The reader monad; intensionality and variable binding 

Another monad often seen in computer science is the reader monad, also 
known as the environment monad. This monad encodes dependence of val- 
ues on some given input. To define the reader monad, fix a type p — say the 
type s of possible worlds, or the type g of variable assignments — then let 

Ma = p^a Va, (24.16a) 

7](a) = Xw.a: Ma Va : a, (24.16b) 

m-kk = Xw. k(m(w))(w) : Mf3 Vm : Ma, k : a -> M(3. (24.16c) 

Note how the definition of m-kk threads the input w through both m and k to 
produce the result. To see this threading process in action, let us once again 
substitute our monad definition ( |24.1(: ) into the definition of ^4m in ( |24.8j ) : 



A u (f)(x) = Xw. f(w)(x(w)) : M/3 V/ : M(a ->P),x: Ma. (24.17) 

For p = s, we can think of M as the intensionality monad, noting that ( |24.17| ) 
is exactly the usual extensional composition rule. While words such as 
student and know have meanings that depend on the possible world w, 
words such as is and and do not. We can upgrade the latter by applying r). 
For p = g, we can think of M as the variable binding monad, noting 



that ( 24.17 ) is the usual assignment-preserving composition rule. Except for 
pronominals, most word meanings do not refer to the variable assignment. 
Thus we can upgrade the majority of word meanings by applying n. 
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If we substitute the same monad definition (24.16) into the secondary 
composition rule ( |24.12| ), the result is 

A^(f)(x) = Xw. f(w)(x) :M/3 V/ : M(Ma ->P),x: Ma. (24.18) 

For p = s, this is the intensional composition rule; it handles sentence-taking 
verbs such as know and believe (of type s — > (s — > t) — > e — > i) by allowing 
them to take arguments of type s — > i rather than type i. The monad laws, 
by the way, guarantee that A^(f)(x) = Am(/)(?|(i)) for all / and x; the 
function n (in this case a map from s — ► t to s — > s — > t) is simply the 
intension (up) operator, usually written A . 



For p = g, the same formula (24.18|) is often involved in accounts of 



quantification that assume quantifier raising at LF, such as that in |Hcim| 



and Kratzer (1998 ). It handles raised quantifiers (of type g — > (g — > t) — > t) 
by allowing them to take arguments of type g — > t rather than type t. The 
function rj (in this case a map from g — > t to g — > g — > t) is simply the 
variable abstraction operator. 

2.4 The continuation monad; quantification 

Barker (2000) proposed an analysis of quantification in terms of continua- 
tions. The basic idea is to continuize a grammar by replacing each mean- 
ing type a with its corresponding continuized type (a — ► t) — ► t through- 
out. As a special case, the meaning type of NPs is changed from e to 
(e — > t) — > i, matching the original treatment of English quantification 



by pVIontague (1974|) . 

In general, for any fixed type w (say i), we can define a continuation 
monad with answer type u: 

Ma = (a^w)^w Va, (24.19a) 

77(a) = Ac. c(a) : Ma Va : a, (24.19b) 

m-kk = Xc. m(Xa. k(a)(c)) : Mf3 Vm : Ma, fc : a -> M/3. (24.19c) 

The value c manipulated in these definitions is known as the continuation. 
Intuitively, "the continuation represents an entire (default) future for the 



computation" (Kelsey, Clinger, and Rees 1998|). Each value of type Ma 



must turn a continuation (of type a — > uj) into an answer (of type uj). The 
most obvious way to do so, encoded in the definition of r\ above, is to feed 
the continuation a value of type a: 

\John\ = r/(John) = Ac. c(John) : Me, (24.20a) 
[smofes] = r/(smoke) = Ac. c(smoke) : M(e — > t). (24.20b) 

To compute the meaning of John smokes, we first substitute our monad 
definition ( [24.19] ) into the primary composition operation ^4m ( [24.8 ): 

A M {f)(x) = Xc. f(Xg.x(Xy.c(g{y)))) : 



(24.21) 

V/ : M(a -»■ 0), x : Ma. V 1 
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Letting / = ?](smoke) and x = ?7(John) then gives 

{John smokes} = Ac. r/(smoke) (Ay. y(John) (Ay. c(g(y)))) 
= Ac. y(John)(Ay. c(smoke(y))) = Ac. c(smoke(John)) : M.t. 

In the second step above, note how the term Ay. c(smoke(y)) represents the 
future for the computation of [Jo/m], namely to check whether he smokes, 
then pass the result to the context c containing the clause. If John smokes 
is the main clause, then the context c is simply the identity function id^. 
We define an evaluation operator e : Mw — > u by e{m) = m(id w ). Fixing 
uj = t, we then have e(JJohn smokes}) = smoke(John), as desired. 
Continuing to fix uj = t, we can specify a meaning for everyone: 

{everyone} = Ac. Mx.c(x) : Me. (24.22) 

This formula is not of the form Ac. c(. ..). In other words, the meaning 
of everyone non-trivially manipulates the continuation, and so cannot be 
obtained from applying n to an ordinary value. Using the continuized com- 



position rule (24.21), we now compute a denotation for everyone smokes: 

{everyone smokes} = Ac. ^(smoke) (Ay. {everyone} (Ay. c(y(y)))) 
= Xc.{everyone} (Ay. c(smoke(y))) = Ac. Vx. c(smoke(x)) : Mi, 

giving ei^everyone smokes}) = Vx.smoke(x) : t, as desired. 

The main theoretical advantage of this analysis is that it is a compo- 
sitional, in-situ analysis that does not invoke quantifier raising. Moreover, 
note that a grammar continuized is still a grammar — the continuized com- 



position rule ( 24.21 ) is perfectly interpretable using the standard machinery 
of Montague grammar. In particular, we do not invoke any type ambiguity 
or flexibility as proposed by Partee and Rooth (1983| ) and Hendriks (1993| ); 



the interpretation mechanism performs no type-shifting at "run-time" . 

This desirable property also holds of the other monadic analyses I have 
presented. For instance, in a grammar with intensionality, meanings that 
use intensionality (for example {student}) are identical in type to meanings 
that do not (for example {is}). The interpretation mechanism does not 
dynamically shift the type of is to match that of student. 

It is worth relating the present analysis to the computer science liter- 
ature. Danvy and Filinski (1990) studied composable continuations, which 



they manipulated using two operators "shift" and "reset". We can define 
shift and reset for the continuation monad by QWadler 1994 ) 



shift = Xh. Xc.e(h(Xa. Ac. c (c(o)))) : ((a -> Mw) -» Mw) -» Ma, (24.23) 
reset = Am. Ac. c(e(m)) : Mu -> Mw. (24.24) 

Assuming that the V operator is of type (e — > t) — > t, the meaning of 
everyone specified in ( 24.22| ) is simply shift(Ac.(r/ o V)(e o c)). To encode 



scope islands, Barker implicitly used reset. |Filinski (1999| ) proved that, in a 



certain sense, composable continuations can simulate monads. 
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3 Combining monads 



Having placed various semantic phenomena in a monadic framework, we 
now ask a natural question: Can we somehow combine monads in a mod- 
ular fashion to characterize interaction between semantic phenomena, for 
example between intensionality and quantification? 

Unfortunately, there exists no general construction for composing two 
arbitrary monads, say (Mi,7/i,*i) and (M2, 7721*2)) into a new monad of the 
form (M3 = Mi °M2,r/3,*3) ( King and Wadler 1993| ; Jones and Duponcheej 



1991 ). O ne might still hope to specialize and combine monads with addi- 



tional structure, to generalize and combine monads as instances of a broader 
concept, or even to find that obstacles in combining monads are reflected in 
semantic constraints in natural language. 

Researchers in denotational semantics of programming languages have 
made several proposals towards combining monadic functionality, none of 
which are completely satisfactory flMoggi 1990| ; (Steele 19*94 ; Liang, Hudak 



and Jones 1995; Espinosa 1995; [Filinski 1998| ). In this section, I will relate 



one prominent approach to natural language semantics. 



3.1 Monad morphisms 



One approach to combining monads, taken by Moggi, Liang et al., and 



Filinski| , is to compose monad morphisms instead of monads themselves. A 



monad morphism (also known as a monad transformer or a monad layering) 
is a map from monads to monads; it takes an arbitrary monad and trans- 
forms it into a new monad, presumably defined in terms of the old monad 
and supporting a new layer of functionality. For instance, given any monad 
(Mi,?7i,*i) and fixing a type p, the reader monad morphism constructs a 
new monad (M2, 772,^2)) defined by 

M 2 a = p -> Mia Va, (24.25a) 

772(a) = Xw. 771(a) : M 2 a Va : a, (24.25b) 

m-k 2 k = Xw. [m(w) *i Xa. k(a)(w)] : M 2/ 9 Vm : M 2 a, (24.25c) 

k : a -» M 2 0. 

If we let the old monad (Mi, 771, *i) be the identity monad, defined by Mia = 
a, 771(a) = a, and m*i k = k(m), then the new monad fl24.25|) is just the 
reader monad ( |24.16| ). If we let the old monad be some other monad — even 
the reader monad itself — the new monad adds reader functionality. 

By definition, each monad morphism must specify how to embed com- 
putations inside the old monad into the new monad. More precisely, each 
monad morphism must provide a function (pronounced "lift") 

I : Mia -> M 2 a, (24.26) 
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polymorphic in a.[] For the reader monad morphism, I is defined by 

£(m) = Xw.m: M 2 a Vm : Mia. (24.28) 

The continuation monad also generalizes to a monad morphism. Fix- 
ing an answer type cj, the continuation monad morphism takes any monad 
(Mi,77i,*i) to the monad (M 2 ,?7 2 ,* 2 ) defined by 

M 2 a = (a -> Miw) -> Mia> Va, (24.29a) 

% (a) = Ac. c(a) : M 2 a Va : a, (24.29b) 

m*2k = Ac. m(Aa. fc(a)(c)) : M 2 /3 Vm : M 2 a, k : a -> M 2 ^. (24.29c) 

The lifting function £ for the continuation monad morphism is defined by 

£(m) = Ac.(m *! c) : M 2 a Vm : Mia. (24.30) 

Monad morphisms can be freely composed with each other, though the 
order of composition is significant. Applying the continuation monad mor- 
phism to the reader monad is equivalent to applying to the identity monad 
the composition of the continuation monad morphism and the reader monad 
morphism, and yields a monad with type constructor Ma = (a —>/?—> u) — > 
p — > u>. Applying the reader monad morphism to the continuation monad is 
equivalent to applying to the identity monad the composition of the reader 
monad morphism and the continuation monad morphism, and yields a dif- 
ferent monad, with type constructor Ma = p — > (a — > to) — > to. 



3.2 Translating monads to monad morphisms 

The monad morphisms ( |24,25| ) and ( |24.29| ) may appear mysterious, but we 
can in fact obtain them from their monad counterparts ( 24.16[) and ( 24.19|) 



via a mechanical translation. The translation takes a monad (Mo, 7701*0) 
whose r]o and *o operations are A-terms, and produces a morphism mapping 
any old monad (Mi,77i,*i) to a new monad (M 2 ,?? 2 ,* 2 ). The translation is 
defined recursively on the structure of A-types and A-terms, as follows.0 

Every type r is either a function type or a base type. A function type 
has the form t\ — > t 2 , where r\ and t 2 are types. A base type t is a type 
fixed in Mo (p and u> in our cases), a polymorphic type variable (a and (5 

3 By definition, I must satisfy naturality: 

I (rn (a)) = 772(a) Va:a, (24.27a) 

l(mM k) =l(m)*2 {£ok) Vm : Mia, k : a Mi p. (24.27b) 

4 Among other things, the translation requires Mo to be defined as a A-type, and r/o 
and *o to be defined as A-terms. Thus the translation cannot apply to the powerset and 
pointed powerset monads (see footnote ^) . Nevertheless, any monad morphism (including 
ones produced by the translation) can be applied to any monad (including these two 
monads) . 
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as appearing in r\ and * (24.2) and t ( |24.26| )), or the terminal type ! (also 
known as the unit type or the void type). For every type r, we recursively 
define its computation translation [Y] and its value translation [t\ : 

\i] = Mit, [l\ = h In -> T 2 ] = In -» r 2 J = LnJ -> [r 2 l , (24.31) 



where i is any base type. 

Each term e : r is an application term, an abstraction term, a variable 
term, or the terminal term. An application term has the form (ei : t\ — > r 2 ) 
(e 2 : T\) : r 2 , where e\ and e 2 are terms. An abstraction term has the form 
(Ax : T\. e : r 2 ) : — > r 2 , where e is a term. A variable term has the form 
x : r, where x is the name of a variable of type r. The terminal term is ! : ! 
and represents the unique value of the terminal type !. For every term e : r, 
we recursively define its term translation \e\ : pr] : 



[~(ei : i -> n -> • • ■ -> r n -» 

Xyi : [nj Xy n ■■ [r n \ 

\(e 1 : (n -> t 2 ) -> T 3 )(e 2 



(24.32a) 
(24.32b) 
(24.32c) 
(24.32d) 
(24.32e) 
(24.32f) 



(24.33) 



0(ea : 01 = 

.[[e 2 ] *i (Ay : t- M (y ) • • • (Vn))]- 
■ n -> t 2 )] = [ei] (real), 
[Ax : r. e] = Ax : [tJ . [e] , 
\x : t] = r]i(x), 
\x : r\ — » t 2 ] = x, 

where t and t' are any base types, and yo, ■ ■ ■ ,y n are fresh variable names. 
Finally, to construct the new monad (M 2 ,ry 2 ,* 2 ), we specify 

M 2 a = \M a] , n 2 = [r/ ] , * 2 = P*ol , 
l(m) = \Xf :!—>«. %(/(!))] (A! : !. m) Vm : Mia. 

To illustrate this translation, let us expand out M 2 and * 2 in the spe- 
cial case where (Mo,r/o,*o) is the reader monad ( |24.16 ). From the type 
translation rules ( |24.31| ) and the specification of M 2 in ( 24.33| ), we have 

M 2 a =\p->ct\ = [p\ -+\a]=p-> Mia, 

matching ( 24.25a ) as desired. From the term translation rules ( [24.32 ) and 
the specification of * 2 in ( [24.33 ) , we have 

★2 = [Am : p — > a. Xk : a —> p —* (3. Xw : p. k(m(w)} (w)~\ by ( |24.16c|) 

= Am : p -» Mia. Xk : a -> p -> Mi/9. Aw : p. pfe(m(u;)) (10)] by ( p4.32cj ), 
in which 

[fc(m(«;))(w)] = mH *i Xy : p. \k(m(w))] (y ) by (ggjgg) , ([2432dD 
= \k(m(w))] (w) by (p43aD 

= (M *i Ay : p- M (y )) *i Ay : a. [¥| (y )(w) by (|24.32a|) 
= (7/1^) *i Ay : P- m{y )) *i Xy : a. k{y )(w) by (p432d[) , ( gQgp 
= m(to) *i Ay : a- k(y )(w) by (|24.3aj) , 
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matching Q24.25c| ) as desired. 

The intuition behind our translation is to treat the A-calculus with which 
(Mo,77q,*o) is defined as a programming language whose terms may have 
computational side effects. Our translation specifies a semantics for this 
programming language in terms of (Mi , rjx , *i) that is call- by- value and that 
allows side effects only at base types. That the semantics is call-by-value 
rather than call- by-name is reflected in the type translation rules ( 24.31]) , 



where we define |ri — > ti\ to be — > [7-2] rather than pn] — > [Y2] . That 
side effects occur only at base types is also reflected in the rules, where we 
define \t\ — > T2] to be [ti - > r 2j rather than Mi \t\ — > T2J ■ Overall, our 



translation is a hybrid between the call- by-name Algol translation ( Benton 



tion (|Wadler 1992a| , §8; |Benton et aL| , §3.1.3). 



Hughes, and Moggi 200C, §3.1.2) and the standard call-by-value transla- 



3.3 A call-by-name translation of monads 

Curiously, the semantic types generated by monad morphisms seem some- 
times not powerful enough. As noted at the end of § |3.1| , the reader and 
continuation monad morphisms together give rise to two different monads, 
depending on the order in which we compose the monad morphisms. Fixing 
p = s for the reader monad morphism and u = t for the continuation monad 
morphism, the two combined monads have the type constructors 

M cr a = (a -> s -> t) -> s -> t, M rc a = s (a -► t) t. (24.34) 

Consider now sentences such as 

John wanted to date every professor (at the party). (24.35) 

This sentence has a reading where every professor at the party is a person 
John wanted to date, but John may not be aware that they are professors. 
On this reading, note that the world where the property of professorship is 
evaluated is distinct from the world where the property of dating is eval- 
uated. Therefore, assuming that to date every professor is a constituent, 
its semantic type should mention s in contravariant position at least twice. 
Unfortunately, the type constructors M cr and M rc ( 24.34] ) each have only 



one such occurrence, so neither M. cr t nor M. rc t can be the correct type. 
Intuitively, the semantic type of to date every professor ought to be 

((s -> t) -> s -> t) -> s -> t (24.36) 

or an even larger type. The type ( 24.36|) is precisely equal to M cr (s — > t), 



but simply assigning M cr (s — > t) as the semantic type of to date every pro- 
fessor would be against our preference for the reader monad morphism to 
be the only component of the semantic system that knows about the type s. 
Instead, what we would like is to equip the transformation on types taking 
each a to ((s — > a) — » s — > tj s — > t with a composition method for 
transformed values. 
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One tentative idea for synthesizing such a composition method is to 



replace the call- by- value translation described in §3^ with a call- by-name 
translation, such as the Algol translation mentioned earlier ( [Benton et al.| , 
§3.1.2)|] For every type r, this translation recursively defines a type [prj]: 

[M]=M lt) Kn^M = M^M, (24.37) 

where i is any base type. For every term e : r, this translation recursively 
defines a term [[e]] : f[r]] : 

N(e 2 )ll = M(M), W = ^ f2438) 

px:r.e| = Ax: M.H, ^ = »7l(0- 
Applying this translation to a monad (Mo,?70)*o) gives the types 

M : Mia ->• [jMoaH, (24.39a) 

M : (Mia -» |JMo^) - f[M /^. (24.39b) 

If we let (Mo,?70)*o) be the continuation monad ( |24.19[ ) and (Mi,r/i,*i) be 
the reader monad ( |24.16 ), then the type t transformed is 



[{MotH = (Mi* -> Mit) -> Mit = ((s -> t) -> s -> t) -> s -> t, 

as desired. However, unless (Mi,r/i,*i) is the identity monad, the types 
in ( p4,39j ) do not match the definition of monads in ( |24.2| ). In other words, 
though our call-by-name translation does give the type transform we want 
as well as a composition method in some sense, its output is not a monad 
morphism. 

4 Conclusion 

In this paper, I used monads to characterize the similarity between several 
semantic accounts — for interrogatives, focus, intensionality, variable bind- 
ing, and quantification.^] In each case, the same monadic composition rules 
and mostly the same lexicon were specialized to a different monad. The 
monad primitives r/ and * recur in semantics with striking frequency. 

It remains to be seen whether monads would provide the appropriate 
conceptual encapsulation for a semantic theory with broader coverage. In 
particular, for both natural and programming language semantics, combin- 
ing monads — or perhaps monad-like objects — remains an open issue that 
promises additional insight. 

Acknowledgments Thanks to Stuart Shieber, Dylan Thurston, Chris 
Barker, and the anonymous referees for helpful discussions and comments. 



Anoth er possible tr anslation is the standard ("Haskell") call-by-name one (Wadler 



1992a , §8; Benton et al. , §3.1.1). It produces strictly larger types than the Algol call-by- 
name translation, for instance s — > (s —> (s — > a) — > s — > t) — > s — > t. 

6 Other phenomena that may fall under the monadic umbrella include presuppositions 
(the error monad) and dynamic semantics (the state monad). 
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